Epidemic spreading induced by diversity of agents' mobility 
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In this paper, we study into the impact of the preference of an individual for public transport on 
the spread of infectious disease, through a quantity known as the public mobility. Our theoretical 
and numerical results based on a constructed model reveal that if the average public mobility of 
the agents is fixed, an increase in the diversity of the agents' public mobility reduces the epidemic 
threshold, beyond which an enhancement in the rate of infection is observed. Our findings provide 
an approach to improve the resistance of a society against infectious disease, while preserving the 
utilization rate of the public transportation system. 
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I. INTRODUCTION 

The advent of modern transportation systems has 
enhanced the mobility of mankind and has increased 
their range of travel. At the same time, it has inten- 
sified the contact between human beings because of the 
higher human density within transportation systems re- 
sulting from a confluence of people within limited phys- 
ical spaces. The close proximity between travellers pro- 
vides an opportunity for diseases to spread and it is well 
known that infectious disease is the main cause of death, 
disability, as well as social and economic disruption that 
affects millions of people [H-Q- To stop the prolifera- 
tion of infectious disease and their spread, researchers 
have searched for ways to hinder their diffusion A 
typical strategy is to adjust the level of human contact 
through the temporary closure of companies and edu- 
cational institutes. This strategies, however comes at a 
very high price for both society and economy. 

In this paper, we have focused our research on epidemic 
spreading in public transportation system, with the aim 
of understanding how the disease spreads within such a 
system so that mitigating strategies can be determined to 
reduce its social and economic impact. Human mobility 
relates to the activity of moving from one point in space 
to another and can be measured by the frequency and 
distance of travel [Icl - ll4j |. Such human movements have 
been enhanced by public transportation network which 
is an indispensable component of the major metropolitan 
area of a country. For example, the Mass Rapid Transit 
(MRT) system in Singapore has a daily load of around 



700,000 passengers (i.e. 15% of the total population) 
15]. About 90% of Hong Kong citizens rely on public 
transport facilities for commuting with the main concern 
being exposure to airborne pollutants within the pub- 
lic vehicles fl6j . The large flux of commuters in public 
transportation system has typically led to extreme over- 
crowding, especially during peak hours. The resulting 
high rate of human contact implies a high rate of trans- 
missibility of infectious diseases. For example, the risk 
of contracting pulmonary tuberculosis in Peru is higher 
by a factor of 4.09 for those commuting by minibus com- 
pared to those traveling by private transportation [l7j . In 
consequence, commuters tend to avoid public transporta- 
tion during an epidemic outbreak. They choose either to 
stay at home or to commute by private transport. The 
outcome is undesirable: a severe shortage of manpower 
in the workplace and the possible occurrence of major 
traffic congestion. 

In the past decades, there is a lot of interest in studying 
the spread of epidemics within complex networks, which 
includes: (i) the influence of network structure on epi- 
demic spreading [l8-22j. (ii) the development of immu- 
nization strategies [51471, 123I425B. and (hi) epidemic spread- 
ing in community networks [26h28|, in dy namic networks 
[29j-|32j and in adaptive networks [33l436l |. This has mo- 
tivated us to construct a model on epidemic spreading 
that relates to public transportation system. It is known 
that commuters have diverse preference in choosing their 
mode of travel [37j based on their socioeconomic status. 
As a result, the frequency of different agents using the 
public transportation system may differ. In this paper, 
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we shall denote the frequency of utilizing public transport 
for travel as "public mobility" . Individuals with high 
public mobility use public transports very often, while 
individuals with low public mobility hardly use the pub- 
lic transportation system. This dichotomy in the usage 
of public transport prompts the following question : how 
does the diversity of public mobilities affect the speed of 
epidemic spreading? The purpose of this work is to give 
a definite answer to this question. 

The structure of our paper is as follow. The details of 
the model is discussed in Sec.|TT]of this paper. In Sec. Mil 
of the paper, we provide a theoretical analysis that en- 
ables us to determine the lower bound of the epidemic 
threshold. Then, in Sec. lIVi we present simulation re- 
sults which are found to support our theoretical analysis. 
Finally, we end our paper with a discussion and conclu- 
sion in Sec. El 



II. MODEL 

In order to gain a better understanding on our model, 
it is useful to first study a simple model, which can be 
regarded as a null model that serves the purpose of a 
benchmark and validity check. In this simple model, a 
square with length L satisfying periodic boundary con- 
dition is used to represent a society. There are N agents 
in the square. The positions of the agents are randomly 
assigned with a uniform distribution and for the sake of 
simplicity, are assumed to be fixed over time. We also 
assume that agent only interacts with agents who are 
located at a distance that is less than r away. In other 
words, links appear between all pairs of agents whose dis- 
tance from each other is smaller than r. Since the agents 
are fixed in their position, the links that are established 
in this way do not change with time. This null model is 
simple, and its similar forms have also been adopted in 
other works [28|, [29[ . In this model, the average degree 
of the network of agents in the square: (fc), which is de- 
fined as the average number of links that an agent has, 
is approximately given by Nirr 2 /L 2 , and the degree dis- 
tribution of this model satisfies the binomial distribution 
p(k) = C%q k L (l - q L ) N ~ k with q L = Nnr 2 /L 2 . As the 
ratio of the standard deviation of the distribution to its 
mean value tends to zero when N — > oo, we expect the 
degree of the network connection between agents to be 
homogenous. 

In this paper, we use the SIS model to describe the epi- 
demiological process, which is widely adopted to describe 
infectious diseases [3q-l4ll]. In this model, agents can be 
in either of two distinct states: susceptible or infected. 
A susceptible agent may become infected if there are in- 
fected agents within the interaction radius. Suppose a 
susceptible agent has k neighbors within its interaction 
region, of which fcj n f are infected, and the probability 
of being infected by each infected neighbor is p, then the 
probability that agent becomes infected is [1 — (1 — p) kint }. 
At the same time, each infected agent can recover from 



the disease and becomes susceptible. We assume that 
this occurs at a rate of /i. When the ratio pj \x is fixed, 
different pairs of p and fi only affect the definition of the 
time scale of the disease propagation [4l|. Therefore, we 
can set p and /i to be sufficiently small so as to use the ap- 
proximation [1 — (1 — p) kint ] — > pk m f. In other words, we 
can maintain the same results (except for the time scale) 
as long as the ratio p/ fj, is fixed. This approximation has 
been widely adopted in the literatures (see Ref. [35l l42j ) . 

In this paper, we have fixed r = 0.02, p = 0.1, = 0.2. 
In the simulations, all the averaged results and their stan- 
dard deviation (which is indicated by the error-bars) , are 
obtained from 1000 different realizations, if not otherwise 
specified. 

The epidemic threshold of the null model is determined 
by the basic reproductive number Rq with [H, |43| 

Ro=p(k)/n. (1) 

When i?, < 1 the infection dies out in the long run, and 
when Ro > 1 the infection may spread over the popu- 
lation. This condition leads to a critical average degree 
(k) th = ^ and correspondingly the critical number N th 

for a given set of r, p, u and L with N th = 

Now we are ready to introduce our model which fo- 
cuses primarily on the public transportation system. In 
order to study epidemic spreading in public transporta- 
tion system, we have separated the society into two parts: 
the public transportation system (A) and the rest of the 
society (B) (see Fig.QJ. Each part is represented by a 
square which satisfies periodic boundary condition. The 
length of square A is La and the length of square B is 
Lb- Since human contacts within public transportation 




Probability = 1 



FIG. 1. Schematic illustration of the model. The smaller 
square represents the public transportation system and the 
larger square represents the rest of the society. At the begin- 
ning of each time step, each agent may transit to the public 
transportation system with a probability that equals his pub- 
lic mobility. At the end of each time step, all the agents in the 
public transportation system return to the society. Note that 
an agent in darker gray possesses a higher public mobility. 
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system is typically denser, we have set La <C Lb- There 
are a total of N agents in the society, each (labelled by 
the index e) with a public mobility (denoted as "PM" in 
the following) of m e that ranges between and 1. Our 
model begins by assigning a random position for each of 
the N agents in square B. At the beginning of each time 
step, agent e either transits to square A with probability 
m e and then chooses a random position there to stay, or 
remains in square B at the originally assigned position. 
At the end of each time step, all the agents that transit 
to square A return to their original position in square B 
and the whole system prepares for the next time step. 

Similar to the null model, each agent has a contact 
radius of r and a link between two agents is formed 
whenever they are within this radius. The average de- 
gree of agents in square A (B) is (Us) a = Nattt 2 jL\ 
((&)b = NsTrr 2 /L B ), where Na (Nb) is the number of 
agents in the square A (B). 



III. THEORETICAL ANALYSIS 

In this section, we provide a theoretical analysis on the 
epidemic threshold of our model. Suppose p(m) is the 
fraction of agents with PM m such that J p(m)dm = 1. 
Then, the average PM m = J p(m)mdm and the second 
moment D = f p(m)m 2 dm. Let us denote i(m,t) as the 
fraction of infected nodes with PM m at time step t. 
Since i(m,t) represents the fraction of infected agents in 
both square A and B at time t, we expect the evolution 
of i(m, t) to consist of two parts: 

i(m, t + l) = U A (m,t + l) + n B (m, t + 1) . (2) 

Here, Ha(tti, t + 1) denotes the fraction of infected agents 
with PM m at time t + 1 as a result of having visited 
square A at time t. It can be expressed as follow: 

n^(m, t + 1) = i(m, t) ■ to 

+[p(m) — i(m, t)] ■ m ■ pk\^ (i) 
—p:i(m, t) ■ m , (3) 

where k\^ (t) is the average number of infected neighbors 
of an agent in square A at time t and is defined by: 



*$(*) = <*>. 



J i(m' , t)m'dm' 
J p(m')m'dm' 



(4) 



The first term on the right hand side of Eq. ([3]) denotes 
the fraction of infected agents with PM m that have 
moved to square A at time t. The second term on the 
right hand side denotes the fraction of susceptible agents 
with PM 777 who are infected at time t due to their transit 
to square A. The third term on the right hand side repre- 
sents the fraction of infected agents with PM m who have 
recovered from the infected state and are now in the sus- 
ceptible state, as they traverse to square A at time step 
t. On the other hand, n^m,/; + 1) gives the fraction of 
infected agents with PM m at time t + 1 in lieu of having 



remain in square B at time t. Similar to 11^4 (m, t + 1), 
II 73(777, t + 1) takes the following form: 

IIj3(m, t + 1) = 7(777, t) ■ (1 — m) 

+[p(m) - i{m,t)] ■ (1 - to) -pk\J{t) 
—p,i{m, t) ■ (1 — m) , (5) 

where k- J (t) is the average number of infected neighbors 
in contact with an agent in square B at time t and it is 
defined by: 



fi(m',t)(l - m')dm' 
1 — J p(m')m'dm' 



(G) 



In the steady state, we expect i(m,t + 1) = 7(777, t) = 
i*(m). Thus, we have 



—i* (777) = \p(m) — i* (m)l • 777 • fc.^Vft) 
p 

+ [p(m)-e(m)}.k^ it ) 



- [p(m) - i* (m)] ■ to • k> ni ' (t) . (7) 

Multiplying both sides of the equation by m and then 
integrating throughout with respect to m, we obtain 



D 



\{k)A 


+ Mb 


m 


1—777 


\(k)A 


(k) B 


m 


1—777 



(k) b _ n 

^ = 777(A 



1—777 



p 



(8) 



where A = i*/Q and 77 = Q/ft for i* = J i*(m)dm, 
= Ji*(m)mdm and fl = f i*(m)m 2 dm. It is easy to 
see that A — 1 > and rj — 1 > when m G [0,1]. Thus, 
the terms in the square bracket on the left hand side of 
Eq. (JSJ) is positive. In order for the fraction of infected 
agents to be non-zero, i.e. i* > and O > 0, the right 
hand side of Eq. (JSj) has to be positive. Hence, 



D 



(k) A (k)i 



1 — 777 



(A-l) 



{k) B _ M 1 , p. 
- — —m{\ — 1) > — 

1 — 777 p 



(9) 



Eq. © indicates the presence of a lower bound for D, 
which is 



D 



P 



^ = m ( A _ l ) 



(fc) -^(A-l) 



(10) 



1 — 777 



When (k) b <C (k)A and (k)s is small, Eq. (|TU1) can be 
approximated by 



p{k). 



(11) 



Since = Nj ^Y = mj Y"' , ID can also be expressed in 
A . A 

terms of N and La- This expression has the implication 
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that the value of D_ remains vmchange as we scale the 
variables N and L 2 A by the same factor. 

Let us next consider the case where (k) b is not ne- 
glected in Eq. (|10p . Since human contacts in public trans- 
portation system is typically denser, we anticipate square 
A to dominate the infection process. Therefore, agents 
that transit frequently to square A have a larger probabil- 
ity of being infected. Hence, we expect i*{m) ~ p(m)m, 
and therefore A = i*/Q = J p(m)mdm/J p(m)m 2 dm = 
m/D. Thus, Eq. {TDJ becomes 



D 



P 



to — 

1 - m \D 



1 



(k) A (k) 



m 

1 

m \ D 



(12) 



Solving Eq. (|T2"j) for D_ , we obtain 



D 




2m- 



1 — m 



_ 4 _ 2 (k)B ( {k) A , (k) B 



1 — to 



1 — m 



2m 



2 

(k) B 

1 — TO 



(k)A , (fc> 



+ 



TO 



By using (k) A = 



m 



mNirr 

s — 



(13) 



and (k) b = 



'i , we can 



also express D via N, L A and is. 

Similar to Eq. (|11[) , _D is found here to remain invariant 
when N, L\ and L 2 B are varied by the same scaling factor 
if TO is fixed. In fact, Eqs. ([TT|) and (|13|) allow us to obtain 
the lower bound of the variance of PM: 



a 1 = D - m 2 . 



IV. SIMULATION RESULTS 



(14) 



Before we show the influence of the diversity of PM 
on epidemic spreading, let us first study the simple case 
when all the agents have the same PM value to. Wc 
suppose the system contains N agents. When the PM of 
all the agents is to, the number of agents in square A in 
each time step is N A = mN, while that in square B is 
Nb = (1 — to) AT. Therefore, the long time average of the 
degree of all the agents is (k) — m ■ ml ^J r + (1 — to) • 

(l-m)Nirr 2 1 • 1 • • . i 2Nirr 2 u r ^ 

^ , wnicn is approximately to A when L A <C 

B A 

Lb- By adopting Eq. ([T]), we can obtain the threshold 
m th as follow: 



TO th = 



T2 

A 



p Nnr 2 



(15) 



Note that above this threshold, the system may become 
endemic. 



It 

and to" 1 are 



Figure [2] shows the fraction of infected agents i* at the 
steady state versus the value to and (k) for the case when 
the PM of all the agents is to (circle symbols) and the null 
model (square symbols), respectively. By using Eqs. JT]) 
and (dSJl, we have (fc) th = 2 and to*' 1 = 0.26 (detailed 
parameters are indicated in the caption of the figure 
shows that our theoretical estimates on (fc) th 
in accord with the simulation results. Moreover, it shows 
that when (k) > (k) th , the value of the circle symbols 
may be much larger than that of the square symbols, 
which indicates that the extent of the epidemic preva- 
lence is strongly enhanced by the public transportation 
system. In the following, we shall show that a modifica- 
tion to this threshold behavior can occur when we take 
the diversity of the agents' PM into consideration. 

Let us begin by considering the case in which all the 
agents belong to either of two groups: G\ or G*2, with 
all the agents in each group having the same PM. The 
PM of the agents in G\ (G2) is toi (TO2), and the size 
of the group is N\ (A^). Thus, the size of the system 
N = Ni + N 2 , the average PM m — {N x -mi +N 2 -m 2 )/N, 
and the variance of the PM a 2 = (N\ ■ m\ + N 2 ■ 
m 2 )/N — rfl 2 . At each time step, the expected num- 
ber of agents in G\ (G 2 ) transiting to square A is equal 
to miNi(m 2 N 2 ). Hence, on average, square A con- 
tains N A = toiATj + m 2 N 2 = mN agents, and square 
B contains Nb = (1 — to)A^ agents. Therefore, (k) A 
and {k)s can be expressed as (k) A — mNirr 2 /L A and 
(k)s = (1 —m)N-KT 2 1 L 2 B , respectively. Moreover, after 
the process of time averaging, the degree of agent e with 
PM m e is k e = m e (k) A + (1 — m e ){k) b- 

In order to demonstrate the effect of diversity in PM 
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FIG. 2. The open circle symbols shows the fraction of infected 
agents i* at the steady state versus the average degree (k) (the 
lower abscissa) and the average PM m (the upper abscissa) 
for the case that the PM of all the agents are equal to m, 
with N = 1500, L A = 0.25 and L B = 10. The solid square 
symbols illustrate the case for the null model for the sake of 
comparison, with L — 1. 
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on epidemic spreading, we first set mi = m,%. After 
that, we decrease m\ and increase such that to re- 
mains unchanged. This operation increases a 2 from 
without changing m. Fig.^a) illustrates the manner in 
which the PM of the two groups are tuned. Fig.[3jb) 
shows the fraction of infected agents i* at the steady 
state as a function of ni2- When mi = m 2 = 0.2 (i.e., 
a 2 = 0), the system is in a disease free state. When 
W2 exceeds the threshold: ~ 0.568 which is deter- 
mined from Eq. (TTT1) (note that the rest of the parame- 
ters are indicated in the caption of the figure), the sys- 
tem becomes endemic. Compared with the results shown 
in Fig. [2] and Eq. (TIBl , this example shows that the di- 



1.00 




0.128 



FIG. 3. (a) The fraction of agents p against the PM m. The 
two bars in the figure indicate the presence of only two groups, 
with each group having different PM. This figure illustrates 
how the standard deviation of the PM is tuned without chang- 
ing the average PM, which is m = 0.2, as indicated by the 
dashed line, (b) The fraction of infected agents i* at the 
steady state against the PM ni2 and a 2 . The inset shows 
the detailed behavior of i* near the zone of transition. In 
this case, Ni = 1250 and N 2 = 250. Note that all the other 
parameters take the same value as those employed in Fig.0 



versify of the PM can induce epidemic spreading, even 
when m is smaller than the m th in Eq. fjl5j> . Since in 
this case La <C Lb which makes (k)A ^ we can 

use Eq. (TTTj) to calculate the lower bound of the variance 
a 2 . Our calculation gives D_ = 0.067, and correspond- 
ingly = D_ — m 2 = 0.027. This figure shows that our 
simulation results are in accord with the theoretical es- 
timate on the epidemic threshold, above which a finite 
fraction of the infected agents is found to exist. Thus, 
for a fixed to, there exists a threshold for the variance 
of the PM, exceeding which the epidemic spreads and 
the system becomes endemic. Beyond the threshold, the 
fraction of infected agents increases as <r 2 increases. 

In a more general setting, the PM of the population 
may follow an arbitrary distribution. In order to study 
our model in this more general situation, we first need to 
develop an approach to assign PM to agents following a 
given distribution, with to and a 2 tunable in this method. 
The details of this method are presented in the appendix. 

With the observation that the commuters behavior 
and their traveling properties such as distance and time 
interval between journeys are found to be characterized 
by power- law distribution [TT| - [T3 |. we here assume that 
PM follows the distribution: p(m) ~ mT a . (Note that 
our conclusions do not rely on any particular form of 
p(m)). By utilizing the method introduced in the ap- 
pendix, we can tune to and a 2 without changing the form 
of p(m). The results on the fraction of infected agents 
i* at the steady state in the to - a 2 plane are shown in 
Fig. H](a), where the exponent of the power-law distribu- 
tion is a — 3. In this figure darker grey levels indicate a 
larger fraction of infected agents. We observe that for a 
given g 2 , i* increases with an increase in fn, which can 
be understood as follow. A larger to means the transit 
of a larger number of agents to square A. This implies 
a higher average degree for all the agents since (k)A is 
greater than (k) b ■ The outcome is an increase in the 
contacts between humans within the society. Thus, epi- 
demic threshold reduces and disease spreads more easily. 
In particular, when the threshold for a 2 drops to zero, the 
system can still be endemic as long as m is large enough. 
The presence of a threshold here for a 2 is similar to the 
two-mobility group model that we have discussed earlier. 
Just like the two-mobility group model, we observe that 
as a 2 increases beyond a threshold, the fraction of in- 
fected agents increases as the variance increases. Note 
that the solid line in the figure is obtained from Eqs. (fT3|) 
and (TTJ| which denotes our theoretical estimate of the 
epidemic threshold. We can see that the theoretical esti- 
mation conform with our numerical simulation results. It 
is important to note that the theoretical analysis above 
indicates that D ~ 9 and through Eq. (f?]), reveals that D 
is proportional to the probability that a link is infected. 
In other words, for a fixed to, a larger a 2 (see Eq. (TPfj) ) 
implies a larger probability that a link is infected. This 
explains the observation as duly shown in Figs. [3] and 0] 
that increasing a 2 , i.e. the diversity of PM, invariably 
increases the fraction of infected agents. It thus clarifies 
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our inference that diversity in PM has the effect of reduc- 
ing the threshold of an epidemic outbreak. Figure 0] (b) 
shows the fraction of infected agents i* at the steady state 
as a function of a 2 when m = 0.2 for N = 1000, 5000 and 
10000. In all the three cases, we have kept the value of 
N/L\ and N/L 2 B constant. We observe that o£ ~ 0.01 
for all the three cases as indicated by the black arrow, 
which is consistent with the analytical results obtained 
from Eqs. (fT3|) and (fl"4l) . Moreover, when a 2 > o£, i* is 
observed to have very similar monotonically increasing 




#=1000 
A=5000 
#=10000 



FIG. 4. (Color online) (a) A gray-scale plot on the fraction of 
infected agents V at the steady state in the m-a 2 plane. The 
sizes of the error-bars are small and hence are not shown here. 
The parameters employed are: a = 3, N = 2000, G = 20, 
La = 0.25 and Lb = 5. (b) The fraction of infected agents 
i* at the steady state as a function of a 2 when m = 0.2 for 
N = 1000, 5000 and 10000. In all the three cases, N/L 2 A 
and N/L 2 B are maintained at the same value. That is (i) 
N = 1000, L A 0.177, L B - 1 3.54, (ii) N = 5000, L A ^ 0.4, 
L B ^ 7.9, (iii) N = 10000, L A ^ 0.56, L B ^ 11.2. As a result, 
{k)A — 8 and {fc)s — 0.08 for all the three cases. The inset 



behavior for the different number of agents N. 



V. DISCUSSION AND CONCLUSION 

In summary, we have studied into the effects of diver- 
sity in public mobility on epidemic spreading by propos- 
ing a model which separates a society into two parts: the 
public transportation system and the rest of the soci- 
ety. In our model, we have defined public mobility as the 
probability of an agent in the society who opts to take 
public transport at each time step. Our results show 
that a larger diversity in public mobility gives rise to 
a smaller epidemic threshold. Taking into account the 
inevitable diversity in socio-economic status among indi- 
viduals within a population, we have come to the conclu- 
sion that if we are able to control the diversity in human 
behavior, we would be able to enhance the resistance of a 
society against the onslaught of a pending epidemic. Our 
results show that this can be achieved without reducing 
the average public mobility of a society by encouraging 
the population to use both public and private transport 
with uniformity and without biasedness. For example, 
commuters with low public mobility are persuaded to 
take public transport more regularly while commuters 
having high public mobility are urged to travel in private 
transport with greater frequency. In this way, epidemic 
spreading can be slowed down without causing any traf- 
fic congestion as well as any disturbance to the proper 
functioning of the public transportation system. 



Appendix: A method of tuning m and a 2 

In this appendix we report the details of a method 
of tuning m and cr 2 , while maintaining the PM of the 
population according to an arbitrary distribution given 
by p(m) as follow. 

First, we distribute all the N agents evenly into G 
groups so that there are N/G agents in each group. 
Agents in the same group have the same PM, while agents 
in different groups may have different PM. Public mo- 
bility of agents in group j is rrij, with j = 1, • ■ ■ , G. 
The upper and lower bounds of the PM are m m i n = mo 
and m max = mo, respectively. We define F(m) to 
be the primitive function of p(m), such that F(m) — 
f™ g p(m')dm'. Then, we assign PM to the agents ac- 
cording to the following recurrent relations: 

F(mj) - F( mj _i) = ~(F(m G ) - F(m )), 



G 



for j = !,••• ,G. (A.l) 



Given mo and ma, each rrij can be obtained by solv- 
ing Eq. (|A.1[) from j = 1 to j = G, from which m 
shows the zoom in results for the range a 2 € [0.010,0.013]. and a 2 can be determined. Since p(m) is positive for 
The results in this panel are obtained from 5000 different m g [0, 1], rrij can only increase monotonically with 
realizations. Note that the values of the other parameters are j The meaning of Eq (jA.ip can be understood in 
the same as those used in (a). the following way . Based on our definition, p(m) = 
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[F (rrij) — F (m^-i)] /Am with Am 



0. 



This implies that p(m) = Cj (GAm), where C is a con- 
stant equals to F (ma) — F (mo), according to our con- 
struction. Then, for a well defined p(m), we would expect 
G — » oo as Am — > 0. In other words, the distribution of 
PM generated by our approach becomes accurate and 
tends towards the distribution p(m) as G — > oo. 

Furthermore, Eq. (|A.1[) can be easily extended to the 
more general situation of each group having a different 
number of agents. Suppose the size of group j is rij, then 
Eq. (jA.ip can be generalized to 

F( mj ) - F(m H ) = -jt(F(m G ) - F(m )), 

for j = !,-■■ ,G. (A.2) 

However, we shall restrict our investigation to the con- 
dition of same group size as we explore into the effect of 
the diversity of PM. 

Suppose PM follows the distribution: p(m) ~ m~". 
(Note that our formulation based on Eqs. (|A. 1|) and (|A.2[) 
allows p(m) to take any generic form). Then, by means 
of Eq. (|A.1|) . we obtain the PM of agents in group j as 
follow: 



G 



G 



(A.3) 
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The average PM is given by 




m=- 



G 



G 



Gm n 



Ot JTlr 



,G 



(A 



where Q(fi,p,N) = J2f=i(P + J 



)^ is the truncated form 
of the generalized £- function [44j. Similarly, the sec- 
ond moment D of the distribution can be obtained from 
Eq. (|A.3[) as follow: 



„ 1 / m 
D = — 
G 



G 



GmJ" Q 



1 — a m 



-,G 



G 



FIG. 5. (a) The relation between the group index j and the 
corresponding PM rrij is plotted in log-log scale, where a = 3, 
4 jn = 0.1, m G = 0.9, G = 1000 and N = 10,000. The size 
of each group is N/G = 10. Dashed lines are plotted for 
reference, (b) Histogram of the number of agents with PM 
within the range [mo, wg]- By separating the range [mo, tug] 
into 10 parts, the width of each bar is 0.8. Note that an agent 
with PM within the abscissa of a particular bar is counted 
towards the height of that bar. Fhe plot is in log-log scale. 
We have plotted a straight line with a slope of —3 to serve as 
a guide for reference. 



(A.5) 



We observe that m and a 2 = D — m 2 are functions of 
mo, ma and G. Hence, we can adjust the distribution of 
PM by varying the values of m and m G so as to obtain 
different m and a 2 for a given G. Figure[5] (a) shows the 
relation of the group index j and the corresponding PM 
rrij of a power-law distribution with a = 3. In this case, 
we have set = 10000, G = 1000, m = 0.1, m G = 0.9, 
m = 0.18 and a 2 — 0.012. In this figure, we observe that 
about 1% of the total number of groups has PM larger 
than 0.7, while about 90% of the groups has PM smaller 
than 0.3, which manifests a strong heterogeneity in the 
PM. Figure[5] (b) shows the corresponding histogram of 
PM p(m) of the generated sample. We have performed 
a maximum likelihood estimate of the exponent for the 
distribution obtained from Eq. ()A.3|) . The value of the 
most likely exponent is 3.08 ± 0.02 [44T |. which shows a 



good fit to the value of the target exponent, which is 3. 
The slight difference between the actual and target ex- 
ponent results from G being finite, as was pointed out 
above for a non-trivial situation, i.e. a ^ 0, it is only 
when G — > oo that the actual and target exponent have 
a perfect match. We have plotted a straight line with 
a slope of —3 to serve as a guide for reference. These 
results demonstrate the effectiveness of the method. By 
tuning mo and m G , we can obtain different values of m 
and a 2 . We note that our purpose here is not only to 
assign PM to the agents following a certain distribution 
but more importantly to find a way to adjust m and 



a , hence here pirn) 



a could also serve as a sim- 



ple and effective auxiliary function for adjusting m and 
a 2 . In consequence, as long as m and a 2 are obtained 
correctly, we do not expect the slight difference between 
the actual and target exponent to affect our conclusions. 
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We also note that when a power-law distribution is un- 
bounded, the mean and variance of the distribution can 
be infinite if the exponent satisfies certain conditions [3] ■ 
Under these circumstances, any mean and variance ob- 
tained from a set of samples of such a distribution is not 
meaningful because the fluctuation of these quantities 
can become exceedingly large [Hj]. However, the PM in 
our model is bounded within [0, 1]. Therefore, the mean 
and variance of the distribution obtained from Eq. (|A.3[) 
are finite. In this situation, we expect the sample mean 
and sample variance to converge to the population mean 
and population variance of the distribution respectively, 



as the number of samples tends to infinity. Thus, the 
mean and variance of the distribution of our model ob- 
tained from Eq. (|A.3j) are reliable. 
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